Meta plans to release an initial version of its next-generation Llama 3 large language model within the next month. The company will release a number of different models with different capabilities and versatilities during the course of the year. Llama 3 will be able to answer a wider range of questions compared to its predecessor, including questions regarding more controversial topics. Meta has not released any details about the model's size, but it is expected to have about 140 billion parameters - the biggest Llama 2 model has 70 billion.
Wednesday, April 10, 2024This blog post from Meta outlines the infrastructure being used to train Llama 3. It talks through storage, networking, Pytorch, NCCL, and other improvements. This will lay the foundation for Meta's H100s coming online throughout the rest of this year.
Wednesday, March 13, 2024Meta has confirmed plans to release Llama 3, the next generation of its large language model for generative AI assistants, within the next month.
Meta has confirmed plans to release Llama 3, the next generation of its large language model for generative AI assistants, within the next month.
Meta plans to release an initial version of its next-generation Llama 3 large language model within the next month. The company will release a number of different models with different capabilities and versatilities during the course of the year. Llama 3 will be able to answer a wider range of questions compared to its predecessor, including questions regarding more controversial topics. Meta has not released any details about the model's size, but it is expected to have about 140 billion parameters - the biggest Llama 2 model has 70 billion.
OpenAI and Meta are teasing the next iterations of their AI models, expected to feature enhanced reasoning and planning capabilities. Dubbed GPT-5 and Llama 3, the models aim to advance toward artificial general intelligence, with vague release timelines and application details. The tech community remains skeptical given the history of overhyped AI promises with limited substantive evidence.
Meta's AI assistant is being integrated into search boxes within its apps. It will start appearing directly in the main Facebook feed and users can chat with it using Meta's messaging apps. The assistant is also accessible via a standalone website. It runs on Meta's new Llama 3 model, which outperforms competing models of its class on key benchmarks. The assistant integrates real-time search results from both Bing and Google and can generate images.
Meta has released an 8B and 70B model with dramatically improved performance, particularly in reasoning, context length, and code. It is still training a 400B parameter model, which will match Opus in performance. These models are easily the most powerful available open models.
Meta has released Llama 3, an open-source LLM. It performs better on many benchmarks - its various-sized models have similar or better performance compared to Google's, Anthropic's, and Mistral's models.
While this tune performs worse than Hermes, it is likely due to the small dataset used. Otherwise, this is a great walkthrough on how to tune these models to improve desired performance on certain tasks.
The original Llama models had significantly more incorrect refusals. This brief blog shows some examples that previously failed.
Llama 3 has been praised as the best free large language model. Meta is demonstrating its capabilities with real-time AI image generation in WhatsApp. The company says Llama 3 produces sharper and better images than Llama 2 with improved text rendering. Additionally, the model can animate images and create GIFs. Users with access to the beta version of Meta AI can now try the new features.